Model Selection

Multimodal reasoning enhancement

# Multimodal reasoning enhancement

Internvl3 38B Instruct

InternVL3-38B-Instruct is an advanced multimodal large language model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.

Transformers Other

Visualprm 8B V1 1

VisualPRM-8B-v1.1 is an advanced multimodal process reward model with 8 billion parameters, which enhances the reasoning ability of large multimodal language models through the Best-of-N evaluation strategy.

Multimodal Fusion

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase